Faster Joins, Self Joins and Multi-Way Joins Using Join Indices
نویسندگان
چکیده
We propose a new algorithm called Stripe join for performing a join given a join index Stripe join is inspired by an algorithm called Jive join developed by Li and Ross Stripe join makes a single sequential pass through each input relation in addition to one pass through the join index and two passes through a set of temporary les that contain tuple identi ers but no input tuples Stripe join performs this e ciently even when the input relations are much larger than main memory as long as the number of blocks in main memory is of the order of the square root of the number of blocks in the participating relations Stripe join is particularly e cient for self joins To our knowledge Stripe join is the rst algorithm that given a join index and a relation signi cantly larger than main memory can perform a self join with just a single pass over the input relation and without storing input tuples in intermediate les Almost all the I O is sequential thus minimizing the impact of seek and rotational latency The algorithm is resistant to data skew It can also join multiple relations while still making only a single pass over each input relation Using a detailed cost model Stripe join is analyzed and compared with competing algorithms For large input relations Stripe join performs signi cantly better than Valduriez s algorithm and hash join algorithms We demonstrate cir cumstances under which Stripe join performs signi cantly better than Jive join Unlike Jive join Stripe join makes no assumptions about the order of the join index
منابع مشابه
Processing Sliding Window Multi-Joins in Continuous Queries over Data Streams
We study sliding window multi-join processing in continuous queries over data streams. Several algorithms are reported for performing continuous, incremental joins, under the assumption that all the sliding windows fit in main memory. The algorithms include multiway incremental nested loop joins (NLJs) and multi-way incremental hash joins. We also propose join ordering heuristics to minimize th...
متن کاملOn Multi-way Spatial Joins with Direction Predicates
Spatial joins are fundamental in spatial databases. Over the last decade, the primary focus of research has been on joins with the predicate “region intersection.” In modern database applications involving geospatial data such as GIS, efficient evaluation of joins with other spatial predicates is yet to be fully explored. In addition, most existing join algorithms were developed for two-way joi...
متن کاملAre Multi-way Joins Actually Useful?
Multi-way joins improve performance by avoiding extra I/Os from multiple partitioning steps. There are several multi-way join algorithms proposed, and the research results are encouraging. However, commercial database systems are not currently using multi-way joins. Practical issues include modifying the optimizer and execution system to support multi-way operators and ensuring robust and relia...
متن کاملEfficient Temporal Join Processing Using Indices
We examine the problem of processing temporal joins in the presence of indexing schemes. Previous work on temporal joins has concentrated on non-indexed relations which were fully scanned. Given the large data volumes created by the ever increasing time dimension, sequential scanning is prohibitive. This is especially true when the temporal join involves only parts of the joining relations (e.g...
متن کاملGraph Partition Based Muti-Way Spatial Joins
In this paper, we investigate the problem of efficiently computing a multi-way spatial join without spatial indexes. We propose a novel and effective filtering algorithm based on a two phase partitioning technique. To avoid missing hits due to an inherent difficulty in multi-way spatial joins, we propose to firstly partition a join graph into sub-graphs whenever necessary. In the second phase, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Data Knowl. Eng.
دوره 29 شماره
صفحات -
تاریخ انتشار 1997